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Abstract 

The problem of optimally locating sensors on a traffic network to monitor flows has been object of 
growing interest in the past few years, due to its relevance in the field of traffic management and 
control. Sensors are often located in a network in order to observe and record traffic flows on arcs 
and/or nodes. Given traffic levels on arcs within range or covered by the sensors, traffic levels on 
unobserved portions of a network can then be computed. In this paper, the problem of identifying a 
sensor configuration of minimal size that would permit traffic on any unobserved arcs to be exactly 
inferred is discussed. The problem being addressed, which is referred to in the literature as the Sensor 
Location Problem (SLP), is known to be NP-complete, and the existing studies about the problem 
analyze some polynomial cases and present local search heuristics to solve it. In this paper we further 
extend the study of the problem by providing a mathematical formulation that up now has been 
still missing in the literature and present an exact branch and bound approach, based on a binary 
branching rule, that embeds the existing heuristics to obtain bounds on the solution value. Moreover, 
we apply a genetic approach to find good quality solutions. Extended computational results show 
the effectiveness of the proposed approaches in solving medium-large instances. 
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1 Introduction 


Monitoring flows on the network is an important topic in the field of traffic management and control. 
The continuous growth in the demand for private transportation in large urban areas is the cause of 
severe congestion, pollution, time loss in traffic jams and a deterioration in the quality of life. Monitoring 
flows on a network allows traffic managers to control and manage these problematic situations. Even 
though communication technologies for monitoring traffic networks in real time, via sensors and video 
cameras, are currently available, in most cases we have a very large network which is monitored only 
in small part. In this context, partial information on traffic flows (obtained, for example, by located 
sensors on the network), is often used to estimate flows in the network that are not directly observed. 
However, recent studies in the literature show that by properly locating sensors on the network it is 
possible to exactly compute, under the assumption of error free data, the entire set of unobserved flows. 
In this context it is of paramount importance to design optimal strategies to determine sensor locations, 
and the problem of locating sensors on the network has been object of growing interest in the past 
few years. Problems in this class are differentiated according to the type of sensors that need to be 
located (counting sensors, path-ID sensors, vehicle-ID sensors or a combination of them) and flows of 
interest (origin/destination flow volumes, arc flow volumes, route flow volumes, or a combination of 
them). Counting sensors can be considered all those types of sensors that are able to count vehicle on 
a lane(s) of a road (for example conventional inductive loop sensors). They can be located on an arc (a 
vertex) of the network and count the number of vehicles on the arc (on the arcs incident to the vertex) 
during a particular time interval. Path-ID sensors are assumed to be devices that, when located on an 
arc of a network, can measure the flow volume of each route to which that arc belongs. This is the class 
of sensors that de-code active transmission provided by tagged vehicles, for example, freight information 
from trucks or path/schedule information from buses. Vehicle-ID sensors are sensors through which a 
vehicle can be univocally identified while traveling on the network. Licence plate readers or Automatic 
Vehicle Identification (AVI) readers are examples of sensors that belong to this class. 

Following the classification defined in the recent surveys by Gentili and Mirchandani [14]-[15], two 
classes of problems can be identified: (i) locating sensors to fully observe flow values (either arc flows, 
route flows or OD flows) on the network (Sensor Location Flow-Observability Problems), and (ii) locat- 
ing sensors to estimate flows (either arc flows, route flows or OD flows) on the network (Sensor Location 
Flow-Estimation Problems). This dichotomy is derived from the observation that the location of sensors 
on a network (either on the vertices or on the arcs) can be translated into a system of linear equations 
where the set of variables corresponds to the unknown flows and the set of equations comes from the 
deployed sensors. When the resulting system has a unique solution, we say the system is fully observable, 
and therefore all the flows involved in the system are known (that is, they are observable) . The resulting 
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location problem consists of determining the optimum deployment of sensors on the network that results 
in an observable system (Sensor Location Flow-Observability Problems). On the other hand, when the 
system is underspecified, it admits an infinite number of solutions. The related location problem consists 
of determining how to optimally deploy sensors on the network so that the derived flow estimates are as 
good as possible. Generally, underspecified systems arise when one is interested in determining origin- 
destination flow volumes by locating counting sensors on the arcs of the network. This problem has been 
extensively studied in the literature (see for example, Chootinan et al. [10], Elhert et al. [12], Yang et al. 
[18] Kim et al. [19], Lam and Lo [20], Yang and Zhou [23]). On the other hand, observability problems 
arise, for example, when either path-ID sensors or vehicle-ID sensors need to be located on the arcs of 
the network to determine route flow volumes (Gentili and Mirchandani [15]- [16], Castillo et al. [6], [7], 
[8], Cerrone et al. [9]). 

In this paper we focus on the observability problem arising when counting sensors are located on the ver- 
tices of the network and arc flow volumes need to be computed. Specifically, we are interested in locating 
the minimum number of counting sensors on the vertices of a network to compute arc flow volumes on 
the whole network (we refer to this problem as the Sensor Location Problem [SXP]). 

There is an extensive literature related to facility location on networks and, in particular, to the Flow 
Interception Problems. The problem addressed in this paper could seem similar to this well known class 
of problems; however, there is a huge difference. In flow interception problems a set of facilities is to be 
located on the network (generally on the arcs of the network) to intercept flows such that a given function 
of the flow is optimized (e.g., total intercepted flow is maximized [17] or total risk reduction is maximized 
[13]). Any subset of arcs (or vertices) that is selected is feasible for the problem. The only similarity 
between the SLP and this class of problems is the fact that once a facility (a sensor) is located on the 
network, a flow is intercepted. The main issue that makes the SLP unique and different from the flow 
interception problems is that the location of the facility has to be such that from the directly intercepted 
flows all the non-directly intercepted flows on the remaining arcs of the network can be computed. Hence 
the network becomes fully observable in terms of arc flows. 

The Sensor Location Problem was formally stated by Bianco et al. [5], where two heuristics giving 
lower and upper bounds on the solution value were presented, and a necessary condition for feasibility 
was stated. Successively, Bianco et al. [4] developed a combinatorial analysis of the problem and studied 
its computational complexity considering different special cases. Moreover, some graph classes, where the 
problem is polynomially solvable, were also presented. Morrison and Martonosi [21] and Morrison et al. 
[22] addressed the problem on bi-directed trees, giving a necessary and sufficient condition for a subset of 
vertices to be a feasible solution of the problem. They also defined a matrix reduction procedure to test 
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feasibility of a subset of vertices. Confessore et al. [11] further studied the problem and presented new 
heuristic algorithms and approximation algorithms. The quality of the solutions provided by the exist- 
ing algorithms is evaluated by comparison with data-dependent approximation bounds. Indeed, for the 
problem being considered there does not exist any exact approach providing (at least for small instances) 
the exact solution value, nor does there exist a mathematical formulation. 

In this paper we further extend the study of the problem by (i) developing and testing an exact ap- 
proach, based on a branch and bound scheme, to optimally solve the problem, (ii) developing and testing 
a genetic solution algorithm to get near-optimal solutions, and, (iii) providing a mathematical formula- 
tion of the problem based on the concept of MB-paths introduced in [4] . The mathematical formulation 
provided is a flow based formulation whose optimal solution provides a lower bound on the optimum 
solution value of SLP. We point out here that this is the first attempt to formulate this problem whose 
mathematical model was still missing in the literature. We tested our approaches on the set of bench- 
mark instances existing in the literature, on new instances we developed to extend the test cases for the 
problem and on a real world network. Our results show the efficiency of the proposed approaches in 
solving medium-large instances. 

The paper is organized as follows. Section 2 describes the problem being addressed and details the 
results in the existing literature that will be used in the present study. Our mathematical formulation is 
given in Section 3. Section 4.1 describes our genetic approach, while our Branch and Bound algorithm 
is described in Section 4.2. Computational results are discussed in Section 5 and Section 6. Conclusions 
and further research are the subject of Section 7. 

2 Problem description and existing results 

In this section we describe in detail the problem being addressed. For this entire discussion we assume 
that data are error- free. Given a directed graph G = ( N , A ), where N is the set of n vertices and A is the 
set of m arcs, the flow on each arc contains subflows that are generated and/or absorbed from different 
origin/destination pairs. Among the set of vertices, we say v is an Origin/ Destination (OD) vertex if 
the flow is generated and/or absorbed by it. The set of OD vertices of G is denoted by B C N. An OD 
vertex can be either the origin or the destination of flows or both, and it can also be used as a transfer 
node. Each OD vertex can send flow to or receive flow from any other OD vertex. 
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For each vertex v that is not an OD one, the flow conservation constraints hold: 


E /«.•- E /^ = ° (!) 

weFS(v) weBS(v) 

where FS(v) and BS(v) are the outgoing and incoming arcs of vertex v, respectively, and f VjW is the flow 
volume on arc ( v,w ). 

For each OD vertex v £ B, we have the following flow conservation constraint: 

V £ /«,. = «» P) 

WCFS(v) weBS (v) 

where S v ± 0 is the balancing flow at v, that is, a source or a sink flow so that (2) holds. If split ratios* 
associated with the arcs of the network are also known we could define additional relationship involving 
the flows. Indeed, by using split ratios, we can express the total outgoing flow F(v) as a function of the 
flow volume of any outgoing arc. Formally, for each v € N and each outgoing arc (v,w), by using split 
ratios p VfW at arc (v,w), we have: 

fv, w = F(v) ■ p v , w (3) 

From (3) and considering any other outgoing arc of v, say (v,z), we obtain: 



To better understand the definitions introduced before, refer to the example network in Figure 1 with 
6 vertices and 14 arcs. Table 1 gives the unknown arc flows on each arc together with known split ra- 
tios associated with the arcs of the network. Hence, for example, flow on arc (2,6) is equal to 10 (i.e., 
/ 2,6 = 10) and flow on arc (4,5) is equal to 5 (i.e., = 5). The net flow at vertices 1,2,3, and 6 is 

equal to zero; vertices 4 and 5 are OD vertices with balancing flows equal to S 4 = 10 and S 5 = —10, 
respectively. We have, for example, F( 1) = 30, /x , 3 = F(l) • p -13 where px ,3 = |, and, / 1.2 = F(l) • p 1>2 
where pi >2 = When locating a counting sensor on a vertex we assume that we know the flow volumes 
on all the arcs incident to the vertex. Suppose we locate a counting sensor on vertex 1, that is, flows 
/i ,2 = 10, / 2 ,i = 20 /13 = 20, = 10 are directly monitored and become known. We show now how 

these monitored flows, together with the knowledge of the split ratios, allow us to compute the flows on all 
the remaining arcs of the network. By using the split ratio equation (4) , it is possible to define a system 
of linear equations where (i) a single unknown variable, representing an outgoing arc flow, is associated 
with every vertex, except the monitored vertex and its adjacent ones, and (ii) equations correspond to 

*The split ratio associated with the outgoing arc ( v,w ) specifies the fraction 0 < p v>w < 1 of the outgoing flow F(v) 
that leaves vertex v the arc (v,w). 
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flow conservation constraints relative to every vertex of the network. Additional variables S v , denoting 
the balancing flows of OD vertices, are also included in the linear system. Note that the flow conservation 
equation corresponding to the measured vertex 1 does not contain any unknown flow, and thus it can be 
omitted from the system. When vertex 1 is monitored, we obtain the following system corresponding to 
the flow conservation constraints of the network: 


vertex 2 (/ 2) i + / 2> 6 + A, 4 ) 

vertex 3 (/ 3) i + / 3i5 ) 

vertex 4 (s 4j2 + s 4i2 • 

vertex 5 (s 5 ,4 + ss ,4 • ^ + $ 5,4 • 
vertex 6 (s 6 , 5 • ^ + s 6 , 5 ) 


- (A, 2 + «6,5 • + 54,2) = 0 

- (/i*+« m : S9 = 0 

(«M + A, 4) = ^4 

- (A, 5 + «4,2 ' ^ + 06, &) = $5 

- (A,6 + *5,4^) = 0 


(5) 


where: the unknown variables are s 4)2 , ss i4 , se,s, S 4 and £ 5 ; flows A , 2 - /a, 1 .) A, 3> A,i are directly 
monitored; and flows / 2>4 , A, 6 and A, 5 are obtained using split ratios. The remaining unknown flows are 
obtained using equation (1). In this system, the number of variables is equal to the number of equations, 
the underlying coefficient matrix has full rank; therefore, the system has a unique solution. Of course, 
different systems of equations are associated with different sets of vertices. We refer to a subset M of 
vertices whose associated modified flow conservation system has a unique solution as a monitoring set. 
Each monitoring set then represents a set of vertices where sensors should be installed on the network so 
that all the arc flows can be inferred. Due to the high cost of these devices, minimizing the total cost is 
equivalent to minimizing the total number of devices to be installed. Hence, the arising location problem 
consists of finding a monitoring subset MC JV of minimum size (the Sensor Location Problem - SLP). 


The SLP is NP-complete as proved in [4]. The next section presents some known results that will be 
useful in the subsequent sections of the paper. In particular, we will give a brief description of some 
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Figure 2: An example network with seven vertices. 


combinatorial properties of the problem, its formal definition and an existing algorithm for finding a 
lower bound on the solution value. 


2.1 Known results about SLP 

Before presenting the needed results, we introduce some additional definitions. 

Given a subset of vertices M C N, we denote by Adj(M) the subset of vertices that are adjacent to one 
or more of the vertices in M. Consider the network of Figure 2, with 7 vertices and 16 arcs. If M = {1}, 
then the set of adjacent vertices is Adj(M) = {2, 6, 7}. A combined cut C M C A associated with M is the 
subset of arcs in G induced by M U Adj(M). Hence, in our example, the combined cut associated with 
M = {1} is C M = {(1, 2), (2, 1), (1, 6), (6, 1), (1, 7), (7, 1), (7, 6), (6, 7)}. Let us denote by G\ A! the graph 
obtained from G after deleting the subset of arcs A' C A, and, when no confusion may arise, by G\N' 
the graph obtained from G after deleting the subset of vertices N 1 C N. Let Gf = (N[ J , Af) denote 
the f-th connected component of G\D (sometimes denoted simply G, = ( IV) , A,)). Let Bi and Adj(M)i 
be the set of OD vertices and adjacent vertices in the generic i-th component, respectively. Consider 
again the network in Figure 2 and assume there are two OD vertices on the network, say vertex 4 and 
vertex 7, that is, B = {4, 7}. The graph G\Cm obtained from G after deleting the combined cut Cm 
associated with M = { 1} has three connected components: the connected component G\ with vertex set 
N\ = {2, 3, 4, 5, 6}, a degenerate component G% whose vertex set is N? = {7} and a degenerate component 
G3 with vertex set N 3 = {1}. Note that each vertex v G M will form a degenerate component of G\Cm , 
and all the remaining components of the graph do not contain (by construction) vertices of M. Let us 
refer to the degenerate components containing vertices of M as the monitored components and to all the 
other connected components as the not monitored components. Hence, in our example, component G i is a 
not monitored component and contains two vertices of Adj(M), which are Adj{M)\ = {2, 6}, and one OD 
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vertex: that is Bi = {4}. Component G 2 is a not monitored component such that Adj(M) 2 = B 2 = {7}, 
and component G 3 is a monitored component with Adj(M) :i = B :i = 0. If G is a bi-directed graph and 
M is a monitoring set, then each not monitored connected component Gi in G\Cm is such that the 
total number of OD vertices in G, is not greater than the total number of adjacent vertices in the same 
component as stated in the following proposition by Bianco et al. (for details of proof see [5]). 

Proposition 1 Let G be a bi-directed graph. If M C N is a monitoring set, then 

\Bi\<\Adj(M)i\ Vi = l,2,...,q. (6) 

where q is the number of not monitored connected components of G\ Cm ■ 

Consider again our network example in Figure 2 with M = {1} and B = {4, 7}. Clearly, the non moni- 
tored connected components of G\Cm satisfy the necessary condition stated in Proposition 1. Note that, 
if vertices 2 and 6 were also OD vertices, then condition (6) would not be satisfied; indeed the connected 
component G\ would contain 2 adjacent vertices and 3 OD vertices. Then, in this case, we can say 
M = {1} is not a monitoring set, that is, the associated system does not admit a unique solution. 

By the definition of connected component, every pair of vertices in the component is connected by a 
path. Therefore, condition (6) can be restated in terms of paths connecting OD vertices and adjacent 
vertices, as stated in the following proposition by Bianco et al. [4] . 

Proposition 2 Let G be a bi-directed graph. If M C N is a monitoring set, then there exist in G, 
| B \ M | number of MB-paths distinct in the origin and in the destination. 

where an MB-path is a path P starting from a vertex in Adj(M) and ending at an OD vertex such that: 
(Cl) P does not contain two consecutive vertices that belong to Adj(M)-, 

(C2) P does not contain vertices of M. 

Referring again to Figure 2 where M = {1} and B = {4, 7}, we could define the \B\M\ = 2 MB-paths: 
Pi = {2,3,4} and P 2 = {7}. Again note that, if we assume vertices 2 and 6 are also OD vertices, then 
we cannot find a sufficient number of MB-paths that are distinct in the origin and in the destination. 
It is worth noting that condition (6) (and hence the condition stated in Proposition 2), is necessary but 
not sufficient. Indeed, assume on the network in Figure 2 that vertices 4 and 5 are OD vertices, that is, 
B = {4,5}. Assume we locate a counting sensor on vertex 6, that is M = {6}. Then the combined cut 
is Cm = {(6, 7), (7, 6), (6, 1), (1, 6), (1, 7), (7, 1), (6, 3), (3, 6)}, for which we have three connected compo- 
nents: the not monitored component G i with vertex set N- t = {1, 2, 3, 4, 5}, the not monitored component 



Step 0. (Initialization) M = 0; 

Step 1. Sort N, namely, arrange the vertices in N according to decreasing value of <j(v). 
Step 2. Choose the head vertex in N, remove it from N and put it into M. 

Step 3. If YlveM a ( v ) — l-®l > then STOP. Else, goto step 2. 


Table 2: Algorithm LB. 

C ?2 with vertex set N -2 = {7} and the monitored component G 3 with N 3 = {6}. Component G 2 trivially 
satisfies Proposition 1. In G 1 , the total number of adjacent vertices is \Adj(M) 2 \ = 2 and the total num- 
ber of OD vertices is |i? 2 | = 2; hence, also this connected component satisfies condition (6). Moreover, 
we can define \B\M\ =2 MB-paths, Pi = {1, 2, 3, 4} and P 2 = {3, 5}, that are distinct in the origin and 
in the destination. Hence, Proposition 2 is satisfied too. However, it can be verified that the system as- 
sociated with the set M = {6} does not admit a unique solution, and therefore M is not a monitoring set. 

The necessary condition stated in Proposition 1 and Proposition 2 can be used to define a simple greedy 
procedure to compute a lower bound on the optimum solution of the problem. In particular, the LB 
algorithm presented in [5] is a greedy algorithm that selects vertices in v £ N according to a decreasing 
order with respect to their degree S(v), chooses the one with the highest number of adjacent vertices 
(plus one if the vertex is an OD vertex) and places a sensor on it. Vertices are selected until condition 
(6) is satisfied on the entire graph without taking into account the connected components induced by the 
combined cut C M ■ The pseudocode of the algorithm is given in Table 2 where parameter a(v) is equal 
to S(v) + 1 if v is an OD vertex and is equal to S(v) otherwise. 

For example, for the graph in Figure 2 with B = {4, 5}, the lower bound obtained by applying the LB 
algorithm is 1, which is, in this simple case, also the optimum solution. 

The minimum cardinality subset of vertices satisfying condition (6) can be obtained by solving the math- 
ematical formulation provided in the next section. It is a flow commodity formulation that looks for a 
subset of vertices M satisfying Proposition 2. 

3 Mathematical Formulation 

The mathematical formulation we propose is a single commodity flow formulation where each OD vertex 
requires a unit of flow that originates from a vertex in Adj(M). In particular, we consider three different 
sets of variables: (i) binary variables z v = 1 if a sensor is located at vertex v and z v = 0 otherwise; (ii) 




binary variables y v = 1 if a sensor is either located on vertex v or if vertex v is adjacent to a vertex where 
a sensor is located and y v = 0 otherwise; (iii) flow variables x VtW associated with each arc of the graph. 
The provided mathematical formulation is the following: 


E 

eFS(v) 
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( 11 ) 
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0 

Vv£ N\B 
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V ( v , w) £ A 

( 13 ) 

Z V £ 

{ 0 , 1 } 

Vv£N 

( 14 ) 

Vv £ 

{ 0 , 1 } 

Vv£N 

( 15 ) 

x v ,w > 

0 

V ( v , w) £ A 

( 16 ) 


The objective function (7) seeks to minimize the total number of selected vertices. Constraints (8)-(9) 
ensure that variables y v exactly define vertices that are selected or that are adjacent vertices: y v is equal 
to 1 if either vertex v is selected or at least one of its neighbors is selected. In particular, when neither 
v is selected (that is, z v = 0) nor one of its adjacent vertices (that is, J2 w eFS(v) z *» = 0), constraints (8) 
become active and force y v = 0. On the other hand, constraints (9) become active and impose y v = 1 
when the left side is greater than or equal to 1; these can happen into two cases: either when z v = 1 and 
J2 w eFS(v) z w — 0 °r when z v = 0 and J2 W £FS(v) z w 1- Constraints (10) - (13) ensure the definition 
of MB-paths connecting vertices in Adj(M) with vertices in B. In particular, each MB-path is defined 
by sending one unit of flow from the adjacent vertex where it starts to the OD vertex where it ends; 
hence, one unit of flow must be originated from any adjacent vertex that constitutes the beginning of an 
MB-path and must be absorbed by an OD vertex that constitutes the end of an MB-path, all the other 
vertices in the graph must have a zero balancing flow. For any OD vertex v (i.e., v £ B), the net flow can 
be equal either to 0 or to -1 according to the role of the vertex. Indeed, the following cases may occur for 
any v £ B: (i) vertex v is selected, (ii) vertex v is an adjacent vertex, (iii) vertex v is neither selected nor 
it is an adjacent vertex. In the first case, vertex v is part of a degenerate monitored connected component 
and it does not constitute the end of any MB-path, hence, it does not absorb any flow unit and its net 
flow must be equal to zero; in the second case, being v £ B n Adj(M), vertex v can be considered both 
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the starting point and the end point of an MB-path, and hence, it both generates and absorbs one unit 
of flow and the net flow results equal to zero; finally, in the third case, vertex v must be the end of an 
MB-path, and hence, it must absorb one flow unit and the net flow in this case must be equal to -1. 
These three cases are handled through constraints (10) that ensure the flow balance at any vertex v £ B 
is equal to zero flow units if either v £ M or v £ Adj(M) (that is, y v = 1), and the flow balance is equal 
to -1 otherwise (that is, y v = 0). Flow conservation constraints for the other vertices are expressed by 
(11) and (12). If vertex v £ Adj(M) (i.e., y v = 1), then it may or may not be the origin of an MB-path, 
and hence, it may or may not originate one unit of flow: constraints (11) and (12) ensure the net flow 
on the vertex is less than or equal to one; on the other hand, if v ^ Adj(M) (i.e., y v =0), then it is a 
transfer vertex and constraints (11) and (12) force the net flow on the vertex to be equal to zero. Finally, 
constraints (13) ensure that each MB-path is such that it contains neither (i) arcs incident to selected 
vertices (condition Cl) nor (ii) arcs connecting two vertices both belonging to Adj(M) (condition C2). 
Indeed, the flow variable x VjW associated with arc (v, w) is forced to be zero whenever either (i) v € M 
and w £ Adj(M) or (ii) v € Adj(M) and w £ Adj(M). 

We outline that flow variables x VtW are used here to define the MB-paths required to satisfy the condition 
stated in Proposition 2. These variables have nothing to do with actual trips on the network which are 
represented by variables s v/w and S v in the modified conservation system associated with the subset of 
selected vertices. The actual trips on the network can be obtained by solving this modified conservation 
system when it results to be full rank. 

4 Our solution approaches 

In this section we describe our proposed solution approaches for the problem. In particular, we developed 
a Branch and Bound approach and a genetic algorithm to solve SLP. The feasibility test implemented 
for both of our solution approaches takes into account the combinatorial rule suggested by condition (6). 
In particular, given a subset of vertices M, two cases can occur: 

• If property (6) is not satisfied, then we do not need any other verification: M is not a monitoring 
set. 

• Otherwise, if property (6) is satisfied, then the current solution might be a monitoring set; hence, 
we build the associated modified flow conservation system and check whether it admits a unique 
solution or not. 

The methodology adopted for building the modified flow conservation system associated with a subset of 
vertices is the one proposed in [21] and [22] and it is reported in the appendix. 
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4.1 The Genetic Algorithm 

A genetic algorithm is a randomized search optimization heuristic, which is based on the biological process 
of natural selection and can be applied to a variety of combinatorial optimization problems. Genetic al- 
gorithms mimic the evolutionary process by processing a population of solutions simultaneously. Starting 
with solutions that are typically created randomly, better ones are more likely to be chosen for recombi- 
nation with others to form new solutions. Thus new solutions inherit good parts from old solutions. In 
addition to recombining solutions, new solutions may be formed through mutating or randomly changing 
old solutions. The process is repeated until stopping criteria are met. In this way, good solutions or 
also optimal ones could be reached. The main components of our algorithm are described below while the 
pseudocode is given in table 3. 

Solution Representation 

The generic chromosome used to represent a solution is a vector with a number of components equal to 
\B\ (that is, the total number of OD vertices for the instance being considered, which is also an upper 
bound on the optimal solution value). The generic component of the chromosome (the gene) can assume 
an integer value in the interval [1, n] where n = |iV| is the total number of vertices in the graph. 
Fitness Function 

The fitness function associated with each chromosome is based on the total number of different genes 
in the chromosome (i.e. the total number of located sensors |M|) and the level of infeasibility of such a 
choice (dependent on the number of not monitored connected components G , , induced by the combined 
cut Cm, where property (6) is not satisfied). In particular, let us suppose the selected sensors define the 
set M and let q be the number of not monitored connected components induced by the combined cut 
Cm where the necessary condition (6) is not satisfied. In this case, the fitness function value associated 
with M is: 

{ \M\ if q = 0 and M is a monitoring set 

\M\ + |B| if q = 0 and M is a not monitoring set 

\M\ + |f?| * (h+ 1) otherwise 

where 

h = £ m - \Adj(M)i\) 


For example, let us consider the graph in Figure 2 with 7 vertices, and suppose there are four OD ver- 
tices, i.e., B = {2,4, 6, 7}. We would have a chromosome with four elements, and each element could 
take a value in the interval [1, 7]. Chromosome CHi = [2, 4, 6, 1] represents a feasible solution (that is, a 
monitoring set) where vertices 1, 2, 4 and 6 are selected; the corresponding value of the fitness function 
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Step 1. Randomly generate p chromosomes to serve as the initial population; set i = 0. 

Step 2. Build generation i + 1 from generation i as below: 

- Apply the crossover and mutation operator on the population Pi 
to generate p chromosomes in population P l+ i . 

- Eliminate the y% worst chromosomes and substitute them with the 
best chromosomes of population 

Step 3. Repeat Step 2 until i < Maxlteration. 

Step 4- Output the best solution obtained so far. 


Table 3: Genetic Algorithm: Overview. 

is /({1,2,4, 6}) = 4. Chromosome CH 2 = [1,1,1, 1] represents an infeasible solution where vertex 1 is 
selected and the corresponding fitness function value is equal to /({l}) = 1 + 4 * (1 + 1) = 9. 
Crossover Operator 

The crossover operator generates two new chromosomes (children) starting from two chromosomes (par- 
ents) belonging to the actual population. That is, two parent chromosomes, say CHi and CHj, are 
randomly selected from the actual population. The two children CH qi and CH q2 are generated as fol- 
lows. A position h £ [1, |B|] is randomly generated, and the genes from position h up to position h + k 
(where k is a previously specified parameter) are exchanged between the two parent chromosomes to gen- 
erate the two children. For example, in the above example with parents chromosomes CH 4 = [2, 4, 6, 1] 
and CH 2 = [2, 2, 1, 5], by letting h = 2 and k = 1 the two generated children would be CH :i = [2, 2, 1, 1] 
and CH 4 = [2, 4, 6, 5]. 

Mutation Operator 

The mutation operator makes small random changes in the individual of the population, which provides 
genetic diversity and enables the GA to search a broader space for the problem. For each newly gener- 
ated child, we adopted a fixed mutation scheme that changes a single randomly selected gene with a new 
randomly generated value. 

Population Size and Initial Population 

We chose to generate populations whose size is fixed and equal to a parameter p. All the p chromosomes 
that constitute the initial population are randomly generated. 

Population Reproduction Operation 

Let Pi be the actual population. Then the new population P i+ 1 is generated as follows: (i) generate p 
chromosomes by applying the crossover and mutation operator, (ii) replace the y% worst chromosomes 
with the best chromosomes of population Pi. 
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In our implementation of the algorithm, we set the size of the population to p = 100. The percentage 
y in the reproduction operation is equal to 50%. The algorithm stops when the computational time is 
greater than n seconds. 

4.2 The Branch and Bound Approach 

In this section we describe our branch and bound algorithm to find an exact solution to the problem. 
The search tree is a binary search tree formed by at most n levels, each corresponding to a vertex of the 
graph. Therefore, at level k of the search tree, we have 2 k decision nodes since the branch rule consists 
of either choosing or not choosing vertex k in the graph to be part of the solution. 

The LB algorithm (described in Section 2.1) is used inside the Branch and Bound routine to find at each 
node of the search tree a lower bound on the solution value. In particular, such a procedure has been 
modified (see Section 4.2.1) to solve the following constrained version of the SLP: 

Constrained Sensor Location Problem ( CSLP ) 

INSTANCE: A bi-directed graph G = (IV, A), a subset B C N, two disjoint subsets of vertices / C N 
and O C N and a positive integer K < \B\ . 

QUESTION: Is there a monitoring set MCA of vertices with \M\ < K, such that I C M and 
MnO = 0 ? 

A CSLP is associated with each decision node of the search tree, say node h, and two sets, say A 
and Oh, represent the partial solution built so far. The subset A contains vertices of the graph that 
have already been selected to be part of the partial solution, and the subset Oh contains vertices of the 
graph that have been designated as not being part of the partial solution, according to the choices made 
in the path connecting the root node of the search tree to node h. In particular, let us define a set of 
binary variables a q associated with each vertex i € N of the graph whose values are such that a*,; = 1 if 
vertex i is selected and aq = 0 otherwise. Consider now, for example, the partial binary search tree in 
Figure 3 associated with the graph of Figure 2. The first level of the tree corresponds to vertex 1 of the 
graph which can be chosen to be part of the solution (aq = 1) or not (oq = 0); the second level of the 
tree corresponds to vertex 2 of the graph and so on. An instance of the CSLP is associated with each 
node of the tree according to the values of the variables fixed so far. For example, the two disjoint sets 
Jr = {1,4} and Or = {2,3} are associated with node 7 of the search tree. 

The branch and bound scheme consists of a sequence of forward moves and backtracking moves. A for- 
ward move selects a vertex of the graph to be part of the solution (by setting the corresponding variable 
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vertex 1 


vertex 2 


vertex 3 


vertex 4 



Figure 3: The partial binary search tree associated with the graph in Figure 2. 



Xi = 1), while a backtrack operation removes the last selected vertex from the current partial solution 
(by setting the corresponding variable aq = 0). At a generic node of the search tree, a forward move is 
performed if the current partial solution is not feasible and the lower bound computed at the node is 
strictly less than the incumbent feasible solution value. Otherwise, if the solution is feasible, a backtrack 
is carried out and the node is pruned. When a feasible solution is achieved, the node is pruned and 
the incumbent solution is could be updated. The algorithm stops when no further backtracking can be 
performed. 

In particular, the lower bound associated with each decision node (used to carry out a pruning oper- 
ation) is obtained by running the procedure modified-LB, which is described next in this section (see 
Table 5). A feasible initial solution is also associated with the root node; this is the solution provided 
by the genetic algorithm described in Section 4.1. To check the feasibility of the current solution we first 
verify whether property (6) is satisfied. If this is not the case, then the current solution is considered 
unfeasible. On the contrary, if property (6) is satisfied the current solution might be feasible and, to 
check that, we build the associated modified flow conservation system and check whether it admits a 
unique solution. 

The pseudocode of the algorithm is given in Table 4 where the following notation is used: 
x = vector of the current solution (£',; denotes the <-th component); 
z = current solution value (= JAb l *i)j 

x = vector of the best solution so far (a:,; denotes the i-th component); 

z = best solution value so far (= Ya= i x i)i 

LB = value of the lower bound computed at the actual node; 

4.2.1 The Modified-LB 

The modified-LB algorithm, which is used to find a lower bound to the CSLP, is given in Table 5. 
We recall that the CSLP differs from the SLP in that a subset J C N of vertices has already been 
designated to be selected, while a disjoint subset O C N of vertices has been designated not to be part 
of the solution. The main idea of the algorithm consists of applying the LB procedure to each connected 
component of the graph G' = G\I resulting from G after the deletion of the vertices in I and after 
updating the degree of each vertex according to the set Adj(I). In particular, for each (non- degenerate) 
not monitored connected component G' % of G' . the updated degree of each vertex v G N\I counts the 
number of vertices in the component adjacent to v and not adjacent to any vertex in I of the component, 
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Step 0. (Initialization) 
x = 0; 
z = 0; 

x and z are returned by the genetic algorithm; 


Step 1. If x is feasible then: 

- update x and z\ 

- Xi = 0 ; 

- Goto Step 3; 

If LB < z then Goto Step 2 else Xi = 0. 

Step 2. (Forward move) 
i = i+ 1; 

Xi = 1 ; 

Goto Step 1. 

Step 3. (Backtrack move) 
k = i ; 

while k > 1 do 
if Xk = 1 then: 

- x k = 0 ; 

- i = k; 

- Goto Step 2; 
k = k — 1; 

end while. 

Step 4 ■ (End of Algorithm) 

Return x and z. 


Table 4: Branch and Bound Algorithm: Overview. 
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Step 0. (Initialization) 

Let G' = ( N’,A ') be the subgraph of G induced by the set of vertices TV' = TV\7; 
Let G' = (TV', At), be the set of connected components of G', i = 1, 2, . . . , k: 

Set Mi = 0 for each i. 

Step 1. For each connected component G" ( do 

1.1. Compute 6(v) = \(Adj(v)\Adj(I )) fl TV|, for each v G TV/; 

1.2. Compute k = \B n TV'| - \Adj(I) n TV'|; 

1.3. Sort the vertices in TV/\0 according to the decreasing values cf{v)\ 

1-4 • While J2veM t < h do 

1.4.1. Select the head vertex in TV/, remove it from TV/ and put it into M,. 
Step 2. Return M = U t - TV/,- . 


Table 5: Modified-LB algorithm to compute a lower bound on the solution value of the CSLP. 


that is, S(v) = \(Adj(v)\Adj(I)) fl TV'|. Moreover, the size of the set of OD vertices in each component is 
decreased by the total number of vertices in the component that are adjacent to vertices in /, that is, the 
quantity \B fl TV/| decreases to S, — |Bn TV/[ — | Adj(I) (~l TV/|. All the vertices in the component, except 
those in O, are sorted in decreasing order according to the value &(v), that is, equal to S(v) + 1 if vertex 
v is an OD vertex and is equal to S(v ) otherwise. Vertices are then chosen to be part of the selected set 
Mi until J2veMi d( y ) > h- The final lower bound is given by summing up all the lower bounds obtained 
in each component, that is, \M\ = JV |Mj|. 


5 Computational results on benchmark instances 

To experimentally evaluate the algorithms, we test them on the benchmark instances described in [11] 
where two sets of graph classes were defined: complete grid graph scenarios and random grid graph sce- 
narios. We also run our algorithm on the real traffic network of the city of Waynesboro, Virginia, USA. 
All the instances can be downloaded from the authors’ website [1], [2]. All tests were performed on a 
workstation with Intel Core 2 Duo processor at 2.4Ghz and 3GB of RAM. All the procedures have been 
coded in C+- 1-. Library Lapack was used to compute the rank of a matrix. The mathematical model was 
solved using the AMPL language with IBM ILOG CPLEX 12.1. A time limit of one hour was set for 
the branch and bound algorithm, and a time limit of 10 minutes was set to solve the mathematical model. 

Complete Grid Graphs Scenarios 

There are complete grid graphs of dimension 6 x 6, 8 x 8, 10 x 10, 15 x 15 and 20 x 20. The distribution 
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of the OD vertices was generated from a uniform stochastic variable by considering the total number of 
OD vertices to be equal to |£| = n/2 and |B| = n/4, where n is the number of vertices in the grid. The 
split ratios associated with the outgoing arcs of each vertex v are equal to 1/S(v). For each grid, and a 
given number of OD vertices, the reported results are the average size of the returned set M on 10 runs, 
for a total of 120 instances. 

Modified Grid Graphs Scenarios 

Starting from complete grid graphs, new random graphs with different arc densities were generated. In 
particular, given a complete grid graph G = ( N,A ), the value n is the probability that each arc be- 
longs to the new graph. If the resulting graph is not connected, a simple procedure is performed to 
add arcs between vertices of different connected components. Five different complete grid graphs were 
considered whose size are, 6 x 6, 8 x 8, 10 x 10, 15 x 15 and 20 x 20. For each modified grid obtained 
with 7 r € {0.01,0.03,0.04}, the distribution of the OD vertices was generated from a uniform stochastic 
variable by considering the total number of OD vertices to be equal to |i?| = n/2 and \B\ = n/4, where 
n is the number of vertices in the grid. The split ratios associated with the outgoing arcs of each vertex 
v is equal to 1 /8(v). For each grid, each possible value of 7 r and a fixed the number of OD vertices, the 
reported results are the average size of the set M on 10 runs, for a total of 360 instances. 

Table 6 and Table 7 show the results obtained on the complete grid graph scenarios and on the modified 
grid graph scenarios, respectively. All the computational times are expressed in seconds. Column n re- 
ports number of vertices, column b reports number of OD vertices, and column 7r contains the probability 
values for arc generation. Columns BB Sol. and BB Time contain the solution and the computational 
time for our Branch and Bound algorithm. Columns Genetic Sol. and Genetic Time contain the solution 
and the computational time for the genetic algorithm. The lower bound provided by the LB procedure 
is shown in column LB, the corresponding computational time is reported in column LB Time. Finally, 
the last six columns of each table show the lower bounds obtained by solving our mathematical model. 
In particular, columns Integer Sol. and Integer Time contain the optimum solution of the model and 
the corresponding computational time required by the solver. We outline that the optimum solution of 
the mathematical formulation provides a lower bound on the optimum solution value of the SLP. Indeed, 
the formulation seeks a subset of vertices M of minimum size such that a necessary condition for M to 
be a monitoring set is satisfied. However, since the condition is not sufficient, the resulting set could 
not be feasible for the original problem. We also obtained two additional lower bounds by considering 
two relaxations of the model: the mixed integer relaxation obtained by relaxing the integer requirement 
on variables y (that is, constraints (15)) and the linear relaxation obtained by relaxing the integer re- 
quirement both on variables y and z (that is, constraints (14) and (15)). The corresponding results and 


19 



computational time are reported in columns Mixed Sol. and Mixed Time for the mixed integer relaxation, 
and in columns Relaxed Sol. and Relaxed Time for the linear relaxation. The upper part of both the 
tables shows the results obtained by setting the number of OD vertices b equal to | , the lower part of the 
tables contains the results for values of b equal to j. The reported results are the average value of ten 
instances. When the symbol ’*’ appears in the BB Sol. column, it means that in at least one instance 
of the scenario the time limit of 3600 seconds was reached. When the symbol ’*’ appears in the Integer 
Sol. or Mixed Sol. column, it means that in at least one instance of the scenario the time limit of 360 
seconds was reached. 

The branch and bound approach has negligible computational times on small instances (i.e. n = 36, 64), 
both for the regular grids and for the modified grids. It reaches the time limit on all the 10 instances 
of a scenario when n = 225, 400 for both the class of grids, and also when n=100 and the number of 
OD vertices is equal to 50. This reveals that both the size of the graph and the number of OD vertices 
determine the difficulty of the scenario, as was expected. 

Our genetic algorithm always finds the optimum solution when this was known. Moreover, the solution 
returned by the BB when time limit is reached is the same solution returned by the genetic algorithm. 
Hence, all the time spent by the BB approach is used to test optimality of the initial solution provided 
by the genetic algorithm. Computational times of the genetic algorithm are proportional to the number 
of vertices. Hence, this approach is slow compared to the exact branch and bound on the small instances, 
while it outperforms the exact approach on more difficult and larger scenarios, but its computational 
time tends to increase rapidly with the size of the instance. The main drawback of both approaches lies 
in the feasibility test of a solution. Indeed, to test whether a given set of vertices is a monitoring set, 
the rank of a square matrix has to be computed by means of the Lapack library and the computational 
time of the algorithms, given that such a feasibility test is carried out several times for a huge matrix, 
increases very fast as the number of vertices and the number of OD vertices increase. This result was 
somehow expected and shows the importance of defining necessary and sufficient conditions to test for 
feasibility of a given subset of vertices that make use of the combinatorial nature of the problem, and the 
importance of the underlying structure network, so that more efficient algorithms can be designed. 

Let us compare the quality of the lower bounds returned by the three mathematical models and by 
the LB algorithm. First of all, observe that the computational times for the LB algorithm are always 
negligible compared to the computational times required by the solver to solve any of the three math- 
ematical models, both for regular grids and for modified grids. Analyzing the results on regular grids 
more in detail (Table 6), we observe that the solution value returned by the LB algorithm and the opti- 
mum solution of the mathematical model are the same for the complete grid graph scenarios, and such 
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a solution has a value equal to the solution returned by the genetic algorithm on instances with n = 36. 
This explains the negligible time of the BB algorithm on these scenarios: in most of these cases, it does 
not even generate any node in the search tree, since it finds the optimum solution on the root node. The 
mixed and linear relaxations of the models require less computational time. The solution returned by the 
Mixed relaxation is always equal to the optimum solution of the model (except for two instances with 
n = 400 and b = 200 where the time limit of 360 seconds was reached). 

Let us analyze the quality of the lower bounds on the modified grid graph. Consider Table 8 where 
we reported the percentage gaps of the solution values among different approaches. In particular, each 
column Ai — > A 2 is computed as Al ^ i A2 '■ if the gap is negative then the solution value returned by A, 
is less that the solution value returned by A 2 ; otherwise, it is greater. The smaller the absolute value of 
the gap, the more the two solutions are similar. When the mathematical model (either when solved to 
optimality or when the mixed or linear relaxations are considered) reached the time limit, the gap cannot 
be computed and the symbol NA is reported. 

We can observe that the lower bounds returned by the LB algorithm and the lower bounds returned 
by the solution of the mixed relaxation of the model are similar (see column LB— Mixed). Analyzing the 
values on column Integer— > LB, we can see the lower bound of LB procedure is obviously always smaller 
than the optimum solution of the mathematical model. The difference is smaller on those scenarios with 
7 r = 0.01, that is, those scenarios that are more similar to the complete grid scenarios, as was expected. 
The maximum gap is equal to 33% and it is reached when n = 100 and 7r = 0.04. We can also observe 
that in those scenarios with the same number of vertices, the lower bound is worse when the number of 
OD vertices in greater, as was expected. 

6 Computational results on a real worls network 

We also ran our algorithm on the real traffic network of the city of Waynesboro, Virginia, USA. The 
entire network was imported from the OpenStreetMap website [3]. The network has 1306 vertices and 
3532 arcs. We randomly selected 100 vertices as OD vertices. Figure 4 shows the network where the black 
vertices are OD vertices. The genetic algorithm was stopped after a time limit of 1 hour and returned a 
feasible solution equal to 40. The Branch and Bound algorithm was not able to improve such a solution 
after three hours of computation. The LB algorithm returned a lower bound equal to 20 in 0.0 seconds. 
The lower bound returned by the mathematical model was equal to 20 and the required computational 
time was equal to 4.65 seconds. The mixed and the linear relaxation returned solutions equal to 20 and 
19.8 in 0.2 and 0.0 seconds, respectively. 
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7 Conclusions 


In this paper we extend the study of a known NP-complete optimization problem that finds application 
in traffic management and control: the Sensor Location Problem. The existing contributions in the 
literature about the problem consist of the study of the computational complexity and the definition 
of greedy heuristics to get lower and upper bounds. Our contributions fill some gaps present in the 
literature about the problem: (i) we provide a mathematical formulation for the problem (missing in 
the literature until now), (ii) we developed and tested an exact branch and bound approach to find the 
optimum solution of the problem (iii) we developed and tested a genetic approach to find good quality 
solutions. Our computational results were carried out on existing benchmark instances and show that 
the proposed metaheuristic is very efficient for solving the problem on medium-large instances. Our 
computational study revealed that the weak point of any solution algorithm is the feasibility test, which 
currently requires the computation of the rank of a matrix. Since such a feasibility test is used several 
times in any given algorithm, it is of great relevance, for the designing of more efficient algorithms, to 
better analyze the combinatorial structure of the problem and define necessary and sufficient conditions 
for testing feasibility of a given subset of vertices. This is one of the issues our further study of the 
problem is focusing on. We are also studying interesting variants of the problems. In particular, we 
would like to extend the results considering the weighted version of the problem, that is, when there is a 
different cost associated with the location of a sensor both on directed graphs and on bi-directed graphs. 
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Figure 4: Waynesboro Network, Virginia, USA. 


Table 6: Results Comparison for Complete Grids 


n b BB BB Genetic Genetic LB LB Integer Integer Mixed Mixed Relaxed Relaxed 

Sol. Time Sol. Time Sol. Time Sol. Time Sol. Time Sol. Time 


36 18 4.00 0.00 

64 32 7.00 2.63 

100 50 11 . 3 * 3600.00 

225 112 27 . 2 * 3600.01 

400 200 51 . 8 * 3600.01 


4.00 36.00 4.00 0.00 

7.10 64.01 7.00 0.00 

11.30 100.06 10.00 0.00 

27.20 228.15 23.00 0.00 

51.80 440.18 40.00 0.00 


4.00 0.10 4.00 

7.00 0.24 7.00 

10.00 0.91 10.00 

23.00 10.50 23.00 

40.00 34.21 40 . 2 * 


0.01 3.60 0.01 

0.07 6.40 0.01 

1.51 10.00 0.03 

0.81 22.40 0.11 

167.40 40.00 0.36 


36 9 2.00 0.00 

64 16 4.00 0.00 

100 25 5.00 0.33 

225 56 13 * 3600.01 

400 100 24 . 9 * 3601.87 


2.00 36.00 2.00 0.00 

4.00 64.08 4.00 0.00 

5.20 100.07 5.00 0.00 

13.00 234.52 12.00 0.00 

24.90 840.65 20.00 0.00 


2.00 0.03 2.00 

4.00 0.14 4.00 

5.00 0.69 5.00 

12.00 2.69 12.00 

20.00 16.98 20.00 


0.02 1.80 0.01 

0.07 3.20 0.01 

0.34 5.00 0.02 

0.64 11.20 0.10 

19.80 20.00 0.29 


Appendix 

A subset M of vertices is a monitoring set if and only if the associated modified flow conservation system 
has a unique solution, that is, the coefficient matrix Em, of such a system has full rank. In this section, 
we describe the procedure for building the modified flow conservation system, and therefore matrix E M , 
associated with a subset M of vertices. The description and definitions given here are the same as those 
given in [21] and [22]. 
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Table 8: Percentage Gap Comparison for Modified Grids 



Step 3 Build a square matrix H = {hij} with n rows and n columns as follows: 

{ aij if vertex i and vertex j are connected 

-J2 w eA-ti)^i if *=J 

0 otherwise 

Note that each row and each column of matrix H are associated with a vertex of the graph. 

Step 4 Build the matrix C = {c,j} with n rows and |2?| columns, where each row is associated with a verte: 
and each column is associated with an OD vertex in the graph. The generic column of matrix C 
associated with the OD vertex i is such that all its elements are equal to 0 except the elemen 
corresponding to row i, which is equal to 1. 

Step 5 Let E = [H\C\ be the adjunct matrix obtained by appending the columns of matrix C to th 
columns of matrix H. 

Step 6 For each vertex i £ M do the following: 


delete from E the row corresponding to vertex i: 




- delete from E the column of submatrix H corresponding to vertex i: 

- if vertex i £ B, delete from E the column of submatrix C corresponding to the OD vertex i; 
Step 1 For each vertex j £ Adj(M) delete from E the column of submatrix H corresponding to vertex j; 
Step 8 Let Em be the resulting matrix. 
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